Picture for Zhi Zhang

Zhi Zhang

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

Add code
May 31, 2026
Viaarxiv icon

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Add code
May 26, 2026
Viaarxiv icon

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

veScale-FSDP: Flexible and High-Performance FSDP at Scale

Add code
Feb 25, 2026
Viaarxiv icon

ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

Add code
Feb 17, 2026
Viaarxiv icon

Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

AI-Native 6G Physical Layer with Cross-Module Optimization and Cooperative Control Agents

Add code
Jan 07, 2026
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research

Add code
Oct 24, 2025
Viaarxiv icon

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Add code
Oct 21, 2025
Figure 1 for NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
Figure 2 for NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
Figure 3 for NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
Figure 4 for NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
Viaarxiv icon